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Column 18, line 16: 


Paee 26, line 5: 


"stra+ gene" 


~strA+ gene— 


Column 19, line 36: 


Paee28, line 3: 


"lacz" 


"lacZ— 


Column 26, line 49: 


Paee38, line 26: 


"calorimetrically" 


-colorimetrically- 


Column 31, line 46: 


Paee 45, line 30: 


"stra+ gene" 


-strA+ gene- 


Column 33, line 8: 


Paee 47, lines 30-31: 


"20 |il kanamycin" 


~20|xg/ml kanamycin— 


Column 34, line 21: 


Page 49, line 22: 


"pGenDel 1 pXRD4043" 


"pGenDell. pXRD4043- 


Column 49, line 30: 


Page 71, line 18: 


"EcoRi site" 


"EcoRI site—. 



A true and correct copy of Terminal Disclaimer filed February 23, 2004 and pages 26, 28, 
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In some embodiments, the vector may contain more than one copy number 
indicator. For example, as shown in Figure I and SEQ ID N0:1, pGenDell contains two 
copy number indicators. One of the copy number indicators is the rpsL gene (GenBank 
Accession number J01688, the disclosxire of which is incorporated herein by reference), 
which is also referred to as the strA+ gene. In pGenDell, the strA+ gene is located 
between base 6574 and base 7218 of SEQ ID N0:1, between the Hpal end of the 
pBeloBACl 1 fragment and the left end of TnlOOO. However, those skilled in the art will 
appreciate that the copy number indicator is not limited to this location and may be located 
at any position in the vector consistent with its intended purpose. Similarly, it will be 
appreciated that the copy number indicator may be any copy number indicator consistent 
with its intended purpose, and that the copy number indicator is not limited to those 
specifically listed herein. 

The strA+ gene in pGenDell has been modified to change the base corresponding 
to position 342 of the strA-*- gene of GenBank Accession number J01688 (position 7058 in 
pGenDell) from A to T. This modification does not affect the amino acid sequence of the 
translated protein but removes a Pmll site in order to maintain the Pmll site in the high 
copy number origin of replication as a imique site. The present vectors utilize the new 
observation that when the strA+ gene is present in high copy number, it confers 
streptomycin sensitivity on cells which are normally streptomycin resistant. Thus, prior to 
cloning of an insert into the high copy number origin of repUcation, host cells which are 
normally resistant to streptomycin will be sensitive to streptomycin as a consequence of 
the high copy number of the strA-H gene. When an insert is cloned into the high copy 
number origin of replication thereby reducing the copy number of the vector, the cells will 
become resistant to streptomycin. Thus, streptomycin resistance may be used to positively 
select cells containing vectors with inserts in the high copy number origin of replication. 

In addition to the strA+ gene, pGenDell contains a second marker indicative of the 
copy number of the vector in the host cell. This second marker is a truncated version of 
the LacZ gene which was obtained as follows. The double stranded synthetic linker HEl 
having the sequence AGCTACGGGAAAGCC (SEQ ID N0:4)/ 
HE2:AATTGGCTTTCCCGT (SEQ ID NO: 20) was ligated to pUC19 DNA which had 
been digested v^dth HindHI and EcoRI, thereby eliminating the Hindin and EcoRI sites in 
pUC19. When transformed into bacterial host cells, the modified pUC19 plasmid gives 
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be used as a aeletion indicator which allows identification or selection of cells in which 
transposition has generated a deletion within the insert in the high copy number origin of 
replication. The use of the truncated lacZ gene for this purpose is described below. 

D. Vector Maintenance Marker 

5 The vectors of the present invention also include a marker for selecting host cells 

which contain the vector. This selectable marker may be any of a variety of selectable 
markers known to those skilled in the art. In particular, the selectable marker may confer 
resistance to a drug, biomolecule, metabolite, or other agent. 

In pGenDell, the marker for selecting host cells harboring the vector is the kan^ 

10 gene derived from bases 720 through 1535 of the plasmid pUC4K (GenBank Accession 
No. X06404, the disclosure of which is incorporated herein by reference) available from 
Pharmacia. The kan^ gene is located between bases 9 and 1266 of pGenDell. However, 
those skilled in the art will appreciate that the marker for selecting host cells harboring the 
vector is not limited to this location or this particular marker. Instead, the marker for 

15 selecting host cells harboring the vector may be any marker consistent with its intended 
purpose and may be located at any position in the vector consistent with its intended 
purpose. 

In the pGenBac vectors, the vector maintenance marker is the chloramphenicol 
acetyl transferase gene, which confers resistance to chloramphenicol. Those skilled in the 
20 art will appreciate that a number of other plasmid maintenance markers may also be used. 
The present invention contemplates the use of any plasmid maintenance marker consistent 
with the purposes described herein. 

E. Insert Amplification Primers 

In a preferred embodiment, hybridization sites for insert amplification primers are 
25 located on each side of the cloning sites in the high copy number origin of replication. The 
insert amplification primers may have any sequence complementary to a sequence near the 
cloning sites in the high copy number origin of replication. Preferably, the insert 
amplification primers are at least 15 nucleotides in length. More preferably, the insert 
amplification primers are between 15 and 25 nucleotides in length. 
30 The hybridization sites for insert amplification primers may be from about 20 to 

about 800 bases from the cloning sites in the high copy number origin of replication. 
Preferably, the hybridization sites for insert amplification primers are from about 20 to 
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VECTORS DESIGNED FOR GENERA i iNG NESTED 
DELETIONS USING ENZYMATIC METHODS 
The features of high throughput sequencing vectors which are enzymatic deletion 
vectors are described below. Each of the vectors for generating nested deletions using 
enzymatic methods has the features Usted below. While pGenDell is used as a 
representative of this class of vectors, it will be appreciated that this class of vectors need 
not include all the features of pGenDel. Instead, this class of vectors comprises the 
features listed below. 

The vectors for generating nested deletions using enzymatic methods contain a 
high copy number origin of replication having cloning sites therein, a low copy number 
origin of replication and any controlling genes required for the activity of the low copy 
number origin of replication, a deletion indicator and at least one copy number indicator as 
described above. The enzymatic deletion vectors may also have hybridization sites for 
primers for creating substrates for enzymatic deletions, hybridization sites for primers for 
amplifying deletions, and hybridization sites for primers for sequencing deletions. In 
addition, deletion vectors may also have the other features described above, such as a 
vector maintenance marker, hybridization sites for insert amplification primers, 
hybridization sites for insert sequencing primers, and a single stranded origin of 
replication. 

A. Deletion Indicator 

Preferably, the vectors designed for use in enzymatic methods for generating 
nested deletions have at least one deletion indicator for indicating when an enzymatic 
deletion has progressed into the insert in the high copy nimiber origin of replication. The 
deletion indicator is preferably a marker for which the loss thereof through enzymatic 
deletion is readily detectable. The marker may be a selectable marker or a readily 
assayable protein, such as a colorimetrically detectable protein such as beta galactosidase. 
The deletion indicator is preferably of short length to minimize the recovery of deletions 
which have deleted enough of the deletion indicator to destroy the activity of the deletion 
indicator but have not progressed into the insert in the high copy number origin of 
replication. In those embodiments which are designed for generating nested deletions 
using both enzymatic and transposition-based methods, the deletion indicator for 
determining whether an enzymatic deletion has proceeded into the insert may also act as a 
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copy number replication. The high copy number origin of replication facilitates the 
preparation and manipulation of vector DNA, a crucial feature which limits the 
applicability of the current oriS containing plasmids. 

Cells carrying the pGenDell vector without an insert are kanamycin resistant, 
5 streptomycin sensitive, sucrose sensitive, and dark blue on Xgal and IPTG. As described 
in more detail below, cells carrying pGenDell derivatives in which inserts have been 
cloned into the cloning sites in the high copy number origin of replication are kanamycin 
resistant, streptomycin resistant (where streptomycin resistant host cells are used), sucrose 
sensitive, and light blue on Xgal and IPTG. Moreover, aberrant plasmids resulting fix»m 

10 rearrangement or degradation during the cloning steps usually lack an intact lacZ gene and 
are easily identified due to their white coloration in the presence of IPTG/Xgal. 

Upon cloning an insert into the cloning sites in the high copy number origin of 
replication, the function of the high copy number origin of replication is disrupted and 
replication of the vector is directed by the low copy number origin of replication. In 

15 addition to simplifying the generation and sequencing of nested deletions using 
transposition based methods or enzymatic methods as discussed below, the low copy 
number of the resulting recombinants allows the stable cloning of fragments from several 
hundreds of bases to several hundreds of kilobases in length and permits the maintenance 
of sequences which are unclonable in high copy nimiber. 

20 In pGenDell, the unique Pmll and Seal bltmt ended cloning sites in the high copy 

number origin of replication (ori pUC19 derived), allow the cloning of inserts at these 
sites, which in turn provokes a drastic change in the number of molecules per cell The 
strA+ gene carried by pGenDell renders streptomycin-resistant recipient cells sensitive to 
streptomycin when present at a high copy number. Thus, streptomycin-resistant cells 

25 transformed by a pGenDell plasmid without an insert in the high copy number origin of 
replication (ori pUC19), will be sensitive to streptomycin and unable to grow on 
streptomycin-containing media. However, cells containing recombinant pGenDell 
plasmids with inserts in the high copy number origin of replication (ori pUC19) will be 
present at low copy numbers and will be able to grow in the presence of streptomycin. 

30 Cells containing rare spontaneous mutations which inactivate the strA+ gene 

would also grow on streptomycin. However, such cells can be effectively detected and 
rejected using the truncated LacZ color-based selection system. Cells containing 
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The resulting tractions of 4 -6 kb size were excised and DNA was extracted by 
electroelution and subsequent concentration on Microcon 100 columns. 

The ends of the resulting DNA fragments were repaired by treatment with 4 units 
of Vent DNA polymerase (New England Biolabs) in the presence of 125 ]iM of all four 
5 dNTPs, for 20 minutes at 72° C in a final volume of 20 yl of ThermoPol buffer supplied by 
the manufacturer. The DNA was extracted with chloroform, precipitated in ethanol and 
resuspended in lOmM Tris-HCL pH 7.5, 1 mM EDTA. DNA concentrations were 
estimated by relative fluorescence of agarose gels containing lambda phage DNA samples 
of known concentrations. 

10 B. Preparation of Vector DNA 

pGenDell vector DNA was extracted from overnight cultures by the standard 
alkaline lysis procedure. In order to check the integrity of the vector, l\x\ of the overnight 
cultures was plated on LB agar containing either sucrose and kanamycin or streptomycin 
and kanamycin. Only cultures giving no more than 5-10 resistant colonies in these tests 

15 were used for further DNA extraction. Supercoiled plasmid was obtained by Ethidium 
bromide/CsCl gradient centrifugation. 2]jg of the resulting plasmid DNA was then 
digested with 10 units of Seal enzyme, in conditions recommended by the manufacturer 
(New England Biolabs). After adding EDTA at a lOmM concentration, the vector DNA 
was extracted with phenol, and concentrated by Microcon 100 centrifiigation. 

20 C. Ligation of Insert DNA into Vector and Selection of Clones Having Inserts 

lOng of pGenDell vector DNA was then ligated in a lO^il reaction volume, with 
50ng of the BAC fragment DNA, in the presence of 200 units of T4 DNA ligase 
(Epicentre) according to manufacturer's recommendations, at 12 °C overnight. Following 
addition of 20iig of bacterial tRNA, the ligation products were extracted with phenol and 

25 precipitated in ethanol. 

DIOHB cells containing or lacking the transposase expression plasmid pXRD4043 
were washed, and subjected to electroporation in the presence of the purified ligation 
products, in a 40pl final reaction volume. Following the addition of L broth, the mixtures 
were incubated for 45 minutes at 30° C (to reduce transposition), and plated in 20]il 

30 aliquots on the surface of 100 mm Petri dishes containing 2% agar-LB, 20pg/ml 
kanamycin, 20yg/ml chloramphenicol (in procedures using DHIOB cells containing 
pXRD4043, which confers chloramphenicol resistance), lOOyg/ml streptomycin, IPTG 



DHIOB cells lacking the pXRD4043 plasmid, out of 36 colonies tested, 28 gave inserts of 
4-6 kb, 3 contained inserts of less than 600bp and 5 failed to amplify any product. 

For DHIOB cells containing the pXRD4043 plasmid, out of 192 colonies tested 19 
had inserts of less than 400bp in length, 26 had inserts between 400bp and Ikb in length, 5 
5 had inserts between Ikb and 5kb in length, 130 had inserts between 4 and 7kb in length, 
and 12 gave no amplification product. 

On average, 90%-95% of the colonies contained an insert and 70-80% of the 
colonies contained inserts in the expected size range of 4-7kb. 

The ends of the inserts present in the PGR products were sequenced on ABI 
10 automatic sequencers using the OSl and ORl primers. 

The cloning and selection of pGenDell recombinants having inserts in the high 
copy number origin of replication is summarized in the section of Figure 2B in which the 
insert is cloned into pGenDell. As indicated in Figure 2B, cells containing pGenDell 
derivatives containing an insert are kanamycin resistant, streptomycin resistant, sucrose 
15 sensitive and faint blue on IPTG/Xgal plates. Thus, such cells can be selected and 
identified on IPTG/Xgal plates containing kanamycin and streptomycin, 
n. Generation and Analysis of Nested Deletions Using Transposition Based Methods 

For the generation of nested deletions by transposition, pGenDell is introduced 
into cells containing a source of transposase. As described above, the bacterial strain 
20 DIOHB, containing the plasmid pXRD4043 (Tsai, M.-M., Wong. , RR., Hoang, A.T., 
Deonier, R.C. (1987) J. Bacteriol. 169, 5556-5562.) was used for the initial cloning of 
inserts into pGenDell. pXRD4043 is a moderate copy number pACYC184 plasmid which 
contains the tnpA gene under the control of a synthetic tac promoter. As a selectable 
marker, pXRD4043 contains the chloramphenicol resistance gene. However, it will be 
25 appreciated that the selectable marker on the plasmid which serves as a source of 
transposase may be any of a variety of selectable markers, provided that the same 
selectable maricer is not on the vector containing the insert in which deletions are to be 
generated. 

Initially, the cells are grown at 30°C in order to restrict transposition. When cells 
30 are cultivated at 37 ''C, the transposase becomes active and mediates the intramolecular 
transposition of the TnlOOO ends within pGenDell. Transposition based deletions which 
result in linking the left transposon end to any position within the cloned insert will remove 
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sequences which may be toxic if present at a high copy number. These features may be 
particularly advantageous when constructing genomic DNA libraries such as BAG 
libraries. 

Vectors useful in such applications comprise a high copy number origin of 
5 replication having at least one cloning site therein which is positioned such that the ability 
of the high copy number origin of replication to direct replication is lost when an insert is 
cloned into the cloning site. In addition, the vectors comprise a low copy number origin of 
replication and any genes necessary for the activity of the low copy number origin of 
replication, at least one copy number indicator for indicating the copy number of the vector 

10 in the host cells, and a vector maintenance marker for selecting cells containing the vector. 
If desired, the vectors may also include hybridization sites for primers for amplifying the 
insert and hybridization sites for primers for sequencing the amplified insert. 

Exemplary vectors for maintaining inserts at low copy number (designated 
pGenBacl and pGenBac2) are shown in Figures 13 and 14. Each of these vectors contain 

15 the high copy number origin of replication fi-om pUCl9 with a unique cloning site therein. 
In pGenBacl the unique cloning site in the high copy number origin of replication is an 
EcoRI site, while in pGenBac2 the xmique cloning site in the high copy number origin of 
replication is a BamHI site. When an insert is cloned into the EcoRI site of pGenBacl or 
the BamHI site of pGenBac2, the activity of the high copy number origin of replication is 

20 disrupted. However, as described above, vectors having inserts therein are capable of 
replicating at low copy number under the control of oriS and its associated regulatory 
genes. 

pGenBacl and pGenBac2 also contain the strA gene which permits the selection of 
streptomycin resistant host cells carrying vectors having inserts therein as described above. 
25 In addition, pGenBacl and pGenBac 2 contain the truncated lacZ gene described above as 
a copy nxmiber indicator. 

A gene conferring resistance to chloramphenicol is used as a vector maintenance 
marker. In addition, pGenBacl and pGenBac2 also contain hybridization sites for the 
oriLRd and oriLRr PGR primers described above to permit amplification of the insert 
30 DNA. Hybridization sites for the OSl and ORl sequencing primers are also included to 
allow the amplified insert to be sequenced. 

Example 7 describes the construction of pGenBacl and pGenBac2. 
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